Model Selection

Whisper Fine-tuning

# Whisper Fine-tuning

Whisper Large V3 Speech Flow

A speech fluency classification model based on Whisper Large v3, capable of detecting speech fluency and disfluency types

Audio Classification

Safetensors English

Indian Accent English Whisper Finetuned Epoch 15

An Indian English accent speech recognition model fine-tuned based on OpenAI Whisper-large-v3-turbo, achieving a 7.99% word error rate on Indian English accent datasets

Speech Recognition

Transformers English

Whisper Finetuned

Whisper-large-v3-turbo fine-tuned model for Indian English accent speech recognition, with a word error rate of 4.39%

Speech Recognition

Transformers English

Vlzcrz Whisper Small Japanese 2

A Japanese speech recognition model fine-tuned on the Common Voice 17.0 dataset based on openai/whisper-small

Speech Recognition

Transformers Japanese

Voice Clone Large Finetune Final

This model is a voice cloning model fine-tuned based on openai/whisper-large-v3, primarily used for speech recognition tasks, achieving a word error rate of 15.3572 on the evaluation set.

Speech Recognition

Speech Emotion Recognition With Openai Whisper Large V3

This project utilizes the Whisper model for speech emotion recognition, capable of classifying audio into different emotional categories such as happiness, sadness, and surprise.

Audio Classification

Pronunciation Accuracy

A pronunciation accuracy evaluation model fine-tuned based on OpenAI Whisper-base, used to assess speech pronunciation accuracy

Speech Recognition

Whisper Large V3 Japanese 4k Steps

A speech recognition model fine-tuned on the Common Voice 16.1 Japanese dataset based on openai/whisper-large-v3, trained for 4000 steps

Speech Recognition

Transformers Japanese

Haitian Speech To Text

A Whisper-based speech recognition model optimized for Haitian Creole, featuring high-accuracy speech-to-text conversion

Speech Recognition

Transformers Other

Whisper Large V3 Atco2 Asr

A speech recognition model fine-tuned based on OpenAI Whisper-large-v3, specializing in Air Traffic Control (ATCO) scenarios with a word error rate of 17.04%

Speech Recognition

Whisper Small Keyword Spotting

An audio keyword recognition model fine-tuned based on openai/whisper-small, trained on the kw-spotting-fsc-sl-agv dataset with an evaluation accuracy of 99.98%

Audio Classification

FlandersMakeAGV

Whisper Base Japanese

This model is fine-tuned on the Common Voice, JVS, and JSUT datasets for Japanese speech recognition tasks using openai/whisper-base.

Speech Recognition

Transformers Japanese

Whisper Small Ft Common Language Id

A general language identification model fine-tuned based on openai/whisper-small, achieving 88.6% accuracy on the evaluation dataset

Audio Classification

Whisper Medium Fleurs Lang Id

A speech language identification model fine-tuned on OpenAI Whisper-medium, achieving 88.05% accuracy on the FLEURS dataset

Audio Classification

Whisper Large V2 Cv11 German

An automatic speech recognition model fine-tuned on the Common Voice 11.0 German dataset based on openai/whisper-large-v2, supporting German speech-to-text with a word error rate of 5.76

Speech Recognition

Transformers German

Whisper Medium Ar

A speech recognition model fine-tuned on Arabic datasets based on openai/whisper-medium

Speech Recognition

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase